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Description 

The present invention is concerned with an immunochemical reagent and with polypeptide mimicking 
amino acid sequences of AIDS-related viral proteins. 

5 The disease AIDS (acquired immune deficiency syndrome) only recently has been recognized as a 
rapidly spreading affliction with often dramatic and devastating consequences for the victims. It has also 
been recognized that AIDS is spread by viral infections and this causative virus has been named HTLV-III 
(Science, 224, 497-508, 1984). Alternative names like LAV and ARV-2 were also coined by various 
investigators to name their isolates of the virus causing AIDS. In the present application the name HTLV-III 

10 will be used throughout to encompass also LAV, ARV-2 and all other variants causing AIDS. The thus 
named variants were shown to be very closely related with regard to their genomic organization. 

Nevertheless, by cloning and molecular characterization of HTLV-III, it was shown also that independent 
isolates of the virus have substantial diversity of DNA sequence, as reflected in the differences of their 
restriction enzyme cleavage patterns. These observations, combined with DNA sequence analysis, have led 

75 to the general belief that point mutants responsible for genetic diversity and genetic drift in HTLV-III may 
contribute to the pathogenicity of this virus. Such genetic drift has been shown to account for increased 
pathogenicity of Visna virus isolates (Clements et al., PNAS, 77: 4454-4458, 1980). Likewise, recurrent 
febrile episodes of encephalitis caused by equine encephalitis virus may be the result of mutations arising 
in the infected population (Montelaro et al., Journal of Biological Chemistry, 259: 10539-10544, 1980). The 

20 virus HTLV-III, although shown to be distinct from other RNA viruses by nucleic acid sequence determina- 
tion, is clearly a member of the retrovirus family. 

The general genomic organisation of HTLV-III viral nucleic acid is shown in Figure 1. This figure shows 
three principal genes of the virus. Gag is the group capsid antigens precursor gene; pol is the reverse 
transcriptase gene; and env is the gene of the precursor to the envelope glycoproteins. Two" additional open 

25 reading frames (ORFs),~iabelled 1 and 2, are indicated. The gene of the transactivating factor of HTLV-III 
corresponds to ORF1. The function of any gene product from ORF2 is unknown. The boxed areas at the 
ends of Figure 1 represent the viral long terminal repeats (LTRs). 

The products of the three principal viral genes can be characterized briefly as follows: 

30 I. gag Gene Products . 

Positioned at the 5 f end of the genome, the gag open reading frame sequence encodes a 55K M r - 
(molecular mass) precursor protein. This precursoFls" processed into three mature proteins. Amino acids 1 
through 132 correspond to a 17K virion protein and is probably involved in morphogenesis. The next mature 
35 protein is the major capsid structural protein, p24. The remainder of the gag poly-protein (p15), a highly 
basic protein, is probably involved in nucleic acid binding. ~ 

II. pol Gene Products 

40 The predicted amino acid sequence of the second long open reading frame from HTLV-III shows 
homology and similarity with other pol genes of other retroviruses. This 3,044 nucleotides long open 
reading frame has the potential to~cbde for a 1,015 amino acid precursor protein for the reverse 
transcriptase present in HTLV-III virion. In addition to the 5' reverse transcriptase information, there exists 3' 
genetic information probably involved in endonuclease and integrase functions necessary for viral replica- 

45 tion. 

III. env Gene Products 

The env-lor region of HTLV-III encodes a precursor protein about 863 amino acids long. This primary 
so gene product, M r 160K, can be detected in infected cells. This precursor is cleaved into two products, a 
hydrophilic exterior protein (gp120) that is glycosylated and a hydrophobic transmembrane segment. The 
virion gp41 corresponds to the transmembrane protein. There is a high degree of conservation of amino 
acid sequence at the transmembrane protein processing site. The regions outside of this are less well 
conserved and show considerable difference at the nucleotide and amino acid sequence level when 
55 different viral isolates are compared. 

Once various isolates of HTLV-III had been cloned by different investigators, segments of the genome, 
as defined by DNA sequence analysis, were targeted for genetic engineering. Published work includes 
genetic engineering and expression in E.coli of varying segments of the env-gene of HTLV-III. Viral isolate 
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BH-10 was expressed in open reading frame, vectors (Chang et ai., Science , 228 : 93-95, 1985; Chang et al., 
Biotechnology , 3: 905-909, 1985). Similarly, the env-gene of viral isolate HXB^fwas cloned and expressed 
in E.coli by Crowl et al. (Cell, 41: 979-986, 1985). 

The expression in E.coli of a portion of the LAV genome encoding gag proteins and the use of these 
5 proteins in vaccines, is described in WO 86/06099. EP 181150 describes the expression of hTLR DNA 
sequences, e.g. the sequences encoding gag p25 and gag p16, in E.coli and the use of the recombinant 
proteins for diagnostic purposes. 

WO86/06099 and EP 181150, however, do not mention recombinant gp41 and gp160 env gene products. 
In order to halt the presently observed vigorous spreading of AIDS, routine testing of blood for the 
w presence of the virus or antibodies against it, was introduced on a massive scale. The tests currently in use 
for detection of antibodies against HTLV-III are based on one or more immunoreagents containing antigenic 
material derived from the native virus. 

An at least equally important instrument in halting the spreading of the disease will be a vaccine 
effectively protecting the groups at risk in contacting the virus against infection with HTLV-III. 
75 Both for immuno-diagnostic and for vaccination purposes there is a recognized need for effective 
antigenic material derived from a safe source, hence not derived directly from the native virus. 

The method of choice for the preparation of this (polypeptide) antigenic material is hence the synthetic 
route, in particular by organic chemical synthesis, or by recombinant DNA technology. 

It is the purpose of the present invention to provide an immunochemical reagent which can be put into 
20 use in the above-mentioned applications. 

This immunochemical reagent is characterized in that it comprises a combination of two or more 
synthetic polypeptide sequences selected from the group consisting of 

- a polypeptide sequence mimicking at least one antigenic determinant of the gag-antigen of HTLV-III; 

- a polypeptide sequence mimicking at least one antigenic determinant of the - glycoprotein gp120 of 
25 HTLV-III; and 

- a polypeptide sequence mimicking at least one antigenic determinant of the glycoprotein gp41 of 
HTLV-III. 

With a combination of polypeptide sequences is meant here any physical or chemical composition of 
said polypeptides, such as a single synthetic polypeptide containing said two or more polypeptide 
30 sequences, or a mixture of synthetic polypeptides each containing at least one of said polypeptide 
sequences. 

As indicated above the term synthetic here means derived from any source other than the natural one. 
Hence, it implies among others: prepared by organic chemical synthesis (in particular, for the present 
polypeptides this implies one of the well known methods of polypeptide synthesis, such as the solid phase 

35 method according to Merrifield), or prepared by expression of a recombinant polynucleotide containing a 
polynucleotide sequence coding for one or more of the present polypeptides in a host microorganism. 

The polypeptide sequences mimicking said antigenic determinants may be present as such in the 
immuochemical reagent according to the invention, they may be bonded to each other, and/or to additional 
polypeptide sequences, and/or to a suitable carrier, etc. This is largely dependent upon the intended use of 

40 the immunochemical reagent (vaccination or immuno-diagnosis). 

A vaccine according to the invention comprises the immunochemical reagent and generally also a so- 
called adjuvant, to promote the immune response of the vaccinated individual. Suitable adjuvants are for 
example Freunds' incomplete and complete, Cory nebacteri urn parvum, Bordetella pertussis, muramyl 
dipeptide, saponins, alhydrogel or mineral oils. 

45 The immunochemical reagent in the vaccine optionally may comprise one kind of carrier, to which each 
of the polypeptide sequences are bonded or a number of carriers, each differing in the nature of the 
polypeptide sequences bonded. These carriers may for example be liposomes, or micelles of e.g. saponins 
or the like, or proteins such as key hole limpet hemocyanin or (bovine) serum albumin, or other viruses or 
microorganisms on whose surface the polypeptide sequences according to the invention are exposed (the 

so latter also often named vector-vaccines). 

For use in an immunochemical assay method the immunochemical reagent may either be applied free 
in solution; or may comprise a solid entity or a labelling entity. Said solid entity may, for example, be a 
microtitration plate, or a tube, a stick, a rod, a filter, a bead or a solid particle, such as a latex particle, an 
erythrocyte or a particle of a sol of a dye, a metal or a metal compound. As a labelling entity use can be 

55 made, for example, of a radio-active atom or compound, a fluorescent compound, a phosphorescent 
compound, an enzyme, or a small solid particle such as a particle of a solf of a dye, a metal, or a metal 
compound. For the purpose of the present invention an antigenic determinant is considered the minimum 
structural element of an antigen that enables interaction with antibodies that also recognize the native 
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antigen. For the antigenic determinant as an isolated entry this implies that it should permanently or 
transiently present the same spatial arrangement which the same element possesses in the native antigen. 

Such a polypeptide antigenic determinant comprises generally at least about 4 or 5 amino acids. In a 
linear antigenic determinant these at least 4 or 5 amino acids are arranged as a relatively short polypeptide 
5 chain, whereas in a conformational antigenic determinant the at least 4 or 5 amino acids forming the 
determinant generally are distributed scattered over a considerable length of a polypeptide chain that folds 
and bends in such a way that the relevant amino acids are placed in each other's proximity. Both these 
types of antigenic determinants form part of the present invention. 

The polypeptide sequences mimicking the antigenic determinants of the gag-antigen, the glycoprotein 
w gp120 and the glycoprotein gp41, respectively should be substantially free oFthe naturally occurring gag, 
gp120 and gp41 proteins. Even though, the immunochemical reagent according to the invention optionally 
also may contain one or more of the naturally occurring HTLV-III proteins. 

Preferably, one or more of the polypeptides may comprise only a relevant portion of the respective 
proteins. In particular, the antigenic determinants of the glycoproteins gp120 and gp41 may according to the 
75 present invention suitably be mimicked by a portion of these proteins comprising at least about 4 to about 
160 amino acids. ~ ' 

These portions can be selected by any of the methods known in the art for the selection of antigenic 
determinants polypeptide sequences, or by employing the method to be described in the examples to 
follow. 

20 After having selected the relevant portions of polypeptides advantageously by the method to be 
described below, these selected polypeptides can be synthetically prepared, as already explained by 
organic chemical coupling of the amino acids or by employing well-known recombinant DNA technology. In 
either of the two cases the at least two polypeptides according to the invention may be prepared as isolated 
polypeptides or as fused polypeptides. These fused polypeptides either may comprise at least two of the 
25 two or more synthetic polypeptides in one sequence, or may comprise a non-related polypeptide sequence, 
or a combination of both. A suitable non-related polypeptide sequence may, for example, be an enzyme; 
which in an enzyme-immuno-assay may serve as the label. 

In the investigation underlying the present invention use was made of a method for the systematic 
study of viral gene antigenicity by expressing parts of those genes as fusion proteins in transformed cells. 
30 The various fusion proteins were tested using various methods (Western blot, Elisa) to assess their 
reactivity with respect to various antisera (anti-p24, anti-p41 and anti-p120). Although the following examples 
used transformed E.coli to illustrate the invention, any viable transformed cell line that effectively expresses 
the subject fusion protein could be used according to techniques known to those of ordinary skill in the art. 
Although our method has some similarities to those disclosed in the previously cited HTLV-III cloning 
35 studies, none of these publications allows a systematic picture of antigenic determinants to be assembled. 
The previously published data using the recombinant antigens made in E.coli demonstrate immunoreactivity 
with sera from patients with AIDS. However, unlike the study underlying the present invention, no 
conclusions could be drawn about regions of the env-gene not represented in the cited clones. 

The starting point for this study was a new KTLV-III isolate named HAT-3. The DNA of this virus was 
40 cloned into a phage-derived vector as outlined in the specific examples to follow. 

In Figure 1 restriction sites for this XHAT-3 prophage are indicated. Bars A, B and C refer to segments 
j of the gag and env genes cloned in the examples. The numbers refer to the sizes of these DNA fragment in 
kilobases. Sequence information for these portions of XHAT-3 is given in Figures 2 and 3. 

In Figure 2 the DNA sequence and the polypeptide sequence derived therefrom of fragment A (see 
45 Figure 1 ) from gag are presented. 

The DNA sequence of the entire p24 structural gene is given. The extent of DNA in fragment A is 
indicated by the regions between the two Pstl (CTGCAG) restriction enzyme sites. Figure 3 reports the DNA 
sequence and the derived polypeptide se"quence of fragments B and C (see Figure 1) from env. The DNA 
sequence of the entire env gene is given. Fragment B spans the DNA between the two indicated Pstl sites, 
so Fragment C starts at the (second) indicated Kpnl site and terminates at the same Pstl site as doesTragment 
B. Underlined sites include Pstl = CTGCAG, Kpnl = GGTACC, and Bglll = AGAfCT. 
Although similar in size and length to other env-genes that have~~been sequenced, we undertook our 
systematic study using this gene. 

As a result of the study described herein we were able to identify several immuno-dominant sections of 
55 the protein coded by the env-gene. It was found that immunochemical reagents according to the invention 
advantageously could comprise peptides corresponding with or mimicking said immuno-dominant sections. 

A suitable immuno-dominant section was identified to be comprised within the amino acid sequence 
extending between amino acids No. 30 and 181 inclusive of the env open reading frame of HAT-3 (bar A at 
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the bottom of Figure 7), and that corresponds for example with amino acids No. 38 to 188 Inclusive of 
HTLV-III (Ratner et al., Nature 313, 277-284; 1985); with amino acids No. 38-187 inclusive of ARV-2 
(Sanchez-Pescador et al., Science 227, 484-492; 1985), and with amino acids No. 38-193 inclusive of LAV 
(Wain-Hobson et al., Cell 40, 9-17; 1985). These amino acid sequences are depicted in Figure 8. 

5 Another suitable immunodominant section was identified to be comprised within the amino acid 
sequence extending between amino acids No. 463 and 520 inclusive and more particular between amino 
acids No. 483 and 520 inclusive of HAT-3 (bar B and B 1 respectively at the bottom of Figure 7), that 
corresponds for example with amino acids No. 463-519 (more particularly No. 482-519) of HTLV-III, with 
amino acids No. 461-518 (more particularly No. 481-518) of ARV-2, and with amino acids No. 468-524 

w (more particularly No. 487-524 of LAV) (Figures 9A and 9B, respectively). 

A further immuno-dominant section was identified to be comprised within the amino acid sequence 
extending between amino acids No. 473-642 of the env open reading frame of HAT-3 (bar C at the bottom 
of Figure 7), that corresponds for example with amincTacids No. 571-640 of the env open reading frame of 
HTLV-III, with the amino acids No. 571-640 of the env open reading frame of ARV-2, and with the amino 

is acids No. 576-645 of the env open reading frame~~of LAV. These four corresponding sequences are 
. represented in Figure 10. 

A still further immuno-dominant section was identified to be comprised within the amino acid sequence 
extending between amino acids No. 643-692 and more particularly between amino acids No. 643-683 of the 
env open reading frame of HAT-3 (bar D and D\ respectively, at the bottom of Figure 7), that corresponds 

20 for example with amino acids No. 641-690 (more in particular No. 641-681) of the env open reading frame 
of HTLV-III, with the amino acids No. 641-690 (more in particular No. 641-681) oTlhe env open reading 
frame of ARV-2, and with the amino acids No. 646-695 (more in particular No. 646-686)~of the env open 
reading frame of LAV. These four corresponding sequences are represented in Figures 11A and 11B, 
respectively. 

25 Still another immuno-dominant section was identified to be comprised within the amino acid sequence 
extending between amino acids No. 693-739 and more particularly between amino acids No. 716-739 of the 
env open reading frame of HAT-3 (bar E and E\ respectively, at the bottom of Figure 7), that corresponds 
for example with amino acids No. 691-737 (more in particular No. 714-737) of the env open reading frame 
of HTLV-III, with the amino acids No. 691-737 (more in particular No. 714-737) dTthe env open reading 

30 frame of ARV-2, and with the amino acids No. 696-742 (more in particular No. 719-742)~of the env open 
reading frame of LAV. These four corresponding sequences are represented in Figures 12A and 12B, 
respectively. 

Plasmids p24-10SB.3, pENVKPN45 and pGAG8, and HTLV-III isolate HAT-3 virus were deposited in the 
American Type Culture Collection, 12301 Parklawn Drive, Rockville, Maryland 20852 on December 17th, 
35 1985, prior to filing this patent application, and assigned ATCC Deposit Nos. 53375, 53374, 53373 and 
40213, respectively. 

Plasmids p2-10SB.5, p28-23EB.4, p28-EB.1 and p45ESI.3 were deposited in the American Type Culture 
Collection on July 18, 1986, prior to filing this patent application, and assigned ATCC Deposit Nos. 67159, 
67157, 67158 and 67160, respectively. 

40 

Example 1 
Host strain 

45 DH5 A(lac )U169 . This E.coli strain is a derivative of DH1 (ATCC, Rockville, Maryland), which is lac + - 
(Hanahan, D., J. Mol. Biol. (1 983) 166: 557). 

Standard E.coli genetic techniques were used to introduce the A(lac)U1 69 deletion. This deletion com- 
pletely removes the genes of the lac operon and any homology with the plasmids. The parent strain DH5 is 
a spontaneous isolate of DH1 that has a 10-fold higher transformation frequency compared to DHL The 
50 complete genotype of this strain is: DH5, F~ A (argF - lac) U 169 endA1 hsdR17 (r k ~, m k + ) supE44 thi-1 X - 
reCA1 gyrA96 relAI . 

Example 2 

55 Cloning vector 

The cloning vectors used in these experiments are derivatives of pBR322 (ATCC, Rockville, Maryland). 
All HTLV-III clones are derived from plasmid pMLB1113.IGI. This plasmid carries a portion of E.coli lac 
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operon cloned between the EcoRI site (nucleotide 4361) and the Aval site (nucleotide 1425) of pBR322. The 
lac are oriented in a clockwise direction in the standard pBR322 representation. The lacZ gene fragment 
originated from plasmid pMLB1034 (Berman et ah, U.S. Patent 4,503,142). The lad gene fragment 
originated from plasmid pMC9 (Calos, Nature , 274 , 762-765; 1978). These genes "are cloned from a 

5 naturally occurring Hindi site preceding the lad gene to a naturally occurring Aval site at codon 70 in lacY. 
This corresponds to nucleotides (n.t.) 13-466~7of the pMLB1113.IGI sequence. (The annotated sequence of 
the lac operon has been assembled in Appendix M, pages 273-281 , of Silhavy et aL, supra.) 

In this vector, several changes have been introduced into the lac genes via standard genetic 
engineering techniques. There is a point mutation in the promoter foMacI (W.T. C -* T; n.t. 28 of 

io pMLB1113.IG1) that results in 10X higher levels of repressor synthesis ( LacT q1 ). At codon 5 in the lacZ 
gene, the multiple cloning sites from plasmid PUC8 (Vieira and Messing, Gene, 19: 259-268, 1982) have 
been incorporated (n.t. 1310-1345 of pMLB1113.IG1). The naturally occurring EcoRT site at codon 1006 of 
lacZ is removed by a point mutation (W.T. C — A; n.t. 4352 of pMLB1113.IG1. 

In addition to alterations to the lac genes, a single strand 1 phage origin has been incorporated into the 

75 pBR322 sequence. A 514 base paTT fragment from bacteriophage M13 (ATCC, Rockville, Maryland) has 
been cloned between two adjacent Ahalll sites distal to the bla gene (PBR322 n.t. 3232 to n.t. 3251 
replaced with M13 n.t. 5488 to n.t. 6001; n.t. 6472 to n.t. 6985 ofpMLB1113.IGI). This M13 region functions 
to generate single-stranded DNA from the pBR322 derivative in the presence of a helper M13 phase 
(Zagursky and Berman, Gene (1984) 27: 183-191). These functions have no effect on normal pBR322 

20 replication or maintenance and are only active during M13 phage infection. 

The parenteral cloning vector, pMLB1113.IGI, confers ampicillin resistance and highly regulated synthe- 
sis of native £-galactosidase to the host cell DH5 A(lac). A Lac~ derivative of this vector was used for 
several of the actual cloning experiments. This derivative was altered at the BamHI site (n.t. 1320) by 
cleavage and fill-in with DNA polymerase. This plasmid is designated pMLB1 120.IGI. All the alterations to 

25 these cloning vectors have been verified by DNA sequence analysis of the recombinant regions. 

The two cloning vectors (pMLB1113.IGI and pMLB1120.IGI) used in this study are diagrammed in 
Figure 4. Solid boxes represent the lac genes of E.coli. The M13 intergenic region is indicated by the 
striped box. The gene encoding ampicillin resistance (bla) and the origin of replication (ori) from pBR322 is 
indicated. A frameshift mutation ( + 1) was introduced atlhe multiple cloning site as indicated. This mutation 

30 was confirmed by determining the DNA sequence at the cloning site. 

Example 3 
HTLV-III DNA 

35 

A genomic clone (XHAT-3) of an HTLV-III prophage in bacteriophage was used. This particular clone 
carries an approximately 9Kb HTLV-III DNA fragment from two naturally occurring Sstl sites in the viral 
LTRs. A simplified restriction enzyme site map of this clone is shown in Figure TTTln published DNA 
sequence information from the gag and env gene regions is included here (Figures 2 and 3). This allowed 

40 us to obtain certain restriction enzyme fragments from the original clone and various subclones constructed 
in our lab (Figure 5). A partial map is shown on the top line of Figure 5. The sizes of the EcoRI fragments in 
this region of the clone are indicated. These fragments were subcloned into plasmid vector pMLB1113.IG1. 
Clone 29 carries the 8.3 Kbp EcoRI fragment from the phage. Clone 26 carries the 3.0 Kbp and 1.1 Kbp 
EcoRI fragments. Digestion of these subclones with the appropriate enzymes (as indicated) yielded the 

45 three starting plasmids enclosed in boxes. 

Example 4 

Plasmid pGAG8 (see Figure 5) 

50 ' — 

This clone is derived from plasmid pMLBH20.IGl by the insertion of a 506 base pair Pst1 fragment 
from the p24 coding region of HTLV-III (see Figure 2). This DNA encodes a 169 amino acid polypeptide 
internal to the p24 portion of gag, which represents approximately 70% of the p24 gene product. 

Following induction of the~bperon with 10~ 3 M isopropylthiogalactoside (IPTG), the 134K MW hybrid 
55 protein was produced in an amount of approximately 10-15% of total cellular protein. An equal amount of 
native size 0-galactosidase monomer was also synthesized. In ceil sonicates centrifuged at a low speed 
(-10,000 x g), the hybrid Gag-LacZ protein could be isolated in insoluble form in the pellet. 
This protein was resolubilized in 7M Guanidine-HCI and maintained in solution of 0.05M Guanidine-HCI. 
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Example 5 
Plasmid pENVPSTI 

5 This clone is derived from plasmid pMLB1 120.IG1 by the insertion of a 2339 base pair Pstl fragment 
from the env gene of HTLV-III (see Figure 3). This fragment represents 94% of the viral codingTegion. (The 
coding sequences for gp41 are also carried on this fragment, see above.) 

This clone is the poorest producer of HTLV-III hybrid proteins. Following induction, it produces about 
1% Env-LacZ hybrids. The full size fusion protein (206K MW) is the least abundant, but reacts strongly with 

w positive patient's serum. A major hybrid protein appears at the 149K MW immuno-reactive band. Two 
additional /9-galactosidase hybrids, synthesized in variable amounts, appear at 133K and 124K MW 
positions. Finally, a substantial band is present at the position of /3-galactosidase. All four size classes of 
hybrid proteins reacts with anti-j8-galactosidase antiserum, while the two highest molecular weight species 
react with HTLV-III positive patient serum. 

75 

Example 6 

Plasmid pENVKPN24 and Related Plasmids 

20 This clone is derived from an internal Kpnl - Pstl fragment cloned into phage M13mp18 Gene 33, 103- 
109; 1985and subsequently subcloned (using EcoRI - Hindlll) into pMLB1 1 13.IGI. The resulting" clone 
(pENVKPN) is LacZ" because the translational frame of env and lacZ are not the same. Various 
spontaneous mutants, called "pop-ups", were isolated using a published protocol (Berman and Jackson, J. 
Bacteriol. , 159 : 750-756, 1984). One of these, pENVKPN24, synthesized about 5% levels of the Env-LacZ 

25 hybrid. This clone differed from the pENVPSTI clone in that 152 codons have been removed from between 
the 5' Pstl and Kpnl sites (deletion of 457 n.t.). DNA sequence analysis of this clone revealed a deletion of 
an A residue located at n.t. 3225 of pENVKPN24 that re-aligns the env coding region with the lacZ coding 
segment. Unlike the pENVPSTI clone, pENVKPN24 produces mostlylull size hybrids (-20QK ~MW) and a 
second, less abundant protein at -149K MW. This plasmid was converted to LacZ~ by enzymatically 

30 introducing a frameshift mutation at the Hind lll site adjacent to lacZ. Once again, various LacZ + derivatives 
were selected. (See Figure 6). ~~ 

A second Lac + derivative of plasmid pENVKPN selected the same way is plasmid pENVKPN45. This 
plasmid synthesized about 10-15% levels of Env-LacZ hybrid. In this clone, DNA sequence analysis 
revealed a deletion of 1024 n.t 

35 A third Lac + derivative of plasmid pENVKPN from this selection is plasmid pENVKPN28. This plasmid 
synthesized high levels of a 200K M.W. hybrid at levels similar to pENVKPN24. The same method used for 
pENVKPN24, was used to engineer a LacZ" derivative of pENVKPN28. Various Lac + mutants had suffered 
deletions of env genetic material. DNA sequence analysis of these derivatives showed that all Lac + mutants 
had suffered deletions of env genetic material. The extent of these deletions is summarized in Figures 6 

40 and 7. 

A fourth Lac + derivative p45ESI.3 was constructed by exchanging the EcoRI - StuI 587 bp fragment from 
clone pENVPSTI for the 1 18 bp EcoRI - StuI fragment of pENVKPN45. ~~ 

Example 7 

45 

High level Production of Recombinant Antigens Corresponding to (g)p120, (g)p41 and p24 Viral Gene 
Products 

Several in vitro deletions were introduced enzymatically into the LacZ + clones in order to obtain high 
so levels (1-15% of total cellular protein) of recombinant hybrids. The level of protein production from plasmid 
pGAG8 was sufficient as constructed. This plasmid produces the equivalent of recombinant p24 from our 
particular viral isolate. 

Plasmid pENVKPN45 produces the equivalent of (g)p120 as a /S-galactosidase hybrid at a 10-15% 
level. Plasmids pENVKPNH3 28-45 and p45ESI.3 are additional sources of the recombinant g(p120)hybrid 
55 proteins. The pENVKPNH3 28-45 and p45ESI.3 clones express 0-galactosidase hybrid proteins at 5-10% of 
total cellular protein. 

Plasmid pENVPSTH3 2-10 and pENVKPNH3 24-10 were subjected to restriction nuclease digestion and 
religation. (See Figure 6.) 
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The top of Figure 7 represents a map of the env gene of HTLV-III. The three striped areas of 
hydrophobic regions corresponding to the signal sequence (far left), a region at the N-terminus of the p41 
precursor, and a transmembrane anchor segment (far right). Vertical lines at codon positions 30, 144 and 
810 correspond to the cloning sites Pstl, Kpn l and Pstl, represented as nucleotide positions 112, 544, 2454, 
5 respectively in Figure 3. The site of proteolytic cleavage that yields p120 (left-most) and p41 is indicated by 
the V symbol. 

The right-most column indicates the number of the clone (refer to Figure 6). The open bars indicate the 
extent of DNA from env in the individual clones. The numbers inside of these bars refer to, the first codon in 
env and the last codon in env (using the pENVPSTI sequence as a reference). These details were 

70 determined using DNA sequence information. 

The four bars p28EB1.1, p28-23EB4.1, p2-10SB.5 and p24-10SB.3 represent "DNA" encoding four 
additional gene products from pENVPSTI (no. 1). Internal initiation of protein synthesis begins at the 
naturally occurring methionine codon 483, represented by the dotted vertical lines. This is represented by 
(M) on the Figure. All other bars represent precise alterations to the DNA sequence resulting in the 

75 production of hybrid proteins of the indicated molecular weight. All hybrid proteins show a positive reaction 
with anti-/3-galactosidase anti-serum. Cysteine residues are represented by c's above the env map. The 
vertical dashed lines indicate the location of two Bglll sites used for genetic engineering. (SeeTigure 6). 

Example 8 

20 

Immunochemical testing of HAT-3 env-derived polypeptides 

The env-derived polypeptides produced according to Example 7 were tested with regard to their 
immunochemical properties both using ELISA and Western blot. For these tests use was made of 
25 solubilized material from the harvested cells. 

A. Solubilization of env-derived recombinant polypeptides 

Harvested cells are suspended in a buffer containing 50 mmol/l of Tris, 0.1 mmol/I of PMSF, pH 7.8, in 
30 a ratio of 1 gram of cells: 4 ml of the buffer. Cells are lysed using a French pressure cell at an internal 
working pressure of 20,000 PSI. The lysed cell suspension is evacuated from the pressure cell at a flow rate 
of 1 5 drops/minute into a chilled vessel and centrifuged at 48,000 xg for 30 minutes at 4 * C. The resulting 
pellet material (called (Pi) is washed with a solution containing 50 mmol/l of Tris and 0.1 mmol/l of PMSF 
and centrifuged as described above. The passed . Pi pellet material is then homogeneously suspended, 
35 using a spatula or rubber policeman, into a solution containing 8 mol/l urea, 50 mmol/l of Tris, 10 mmol/l of 
EDTA, 1 mmol/l of DH, 0.1 mmol/l of PMSF with a pH of 8.1; the ratio of grams of initial cells used for Pi: 
ml of solubilizing buffer is 1:4. This suspension is sonicated using a sonicating water bath at room 
temperature for 45 minutes. The solution is centrifuged at 200,000 xg for 45 minutes at 4 * C. The resulting 
supernatant, called S2, may be dialyzed against PBS or against a solution containing 50 mmol/l of Tris, 10 
40 mmol/l of EDTA, 1 mmol/l of DH and 0.1 mmol/l of PMSF, of pH 8.1 to removed urea, or may be left in the 
urea buffer. The thus prepared fraction S2 was used for the ELISA and Western blot analysis. 

B. ELISA 

45 The ELISA is performed according to the following steps: 

1. The wells of a microtitration plate are coated with 100 n\ of antigen obtained by dialysing the S2 
fraction obtained according to A. against 0.05 mmol/l sodium carbonate/bicarbonate buffer, pH 9.6. The 
coating is performed at 4 °C with 0.5-10 ug/ml, during 1-3 days, depending upon the antigen 
preparation. 

50 2. The antigen is aspirated, and the wells are blocked with Blotto (5% non fat dry milk in dhfeO). This is 
coated for 45 minutes at room temperature. 

3. While the plates are being blocked the anti-sera containing antibodies specific against HTLV-III 
antigens which have been diluted 1:100 with Blotto + 10% goat serum/PBS + 1% Triton® X-100 are 
incubated for 30 minutes at 37 * C with a S2 preparation of the E.coli which does not contain a vector 

55 coding for HTLV-III antigens. 

4. Blotto is aspirated from the wells, 100 ul of the absorbed diluted antiserum is added and the 
microtitration plate is incubated for 90 minutes in a heating block at 37 • C. 

5. The serum is aspirated and the wells are washed thrice with 1% glycerol + 0.05% Tween® 20. 
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6. Subsequently 100 ul of a conjugate solution containing 0.143 ug goat-anti -human lgG(H + L)/HRP 
(Kirkegaard and Perry Laboratories, Gaithersburg, MD) per ml of 10% goat serum/PBS + 10% Triton® 
X-100 are added to each well, and incubated for 60 minutes in a heating block at 37 * C. 

7. The conjugate is aspirated and the wells are washed six times. 

8. 100 ul of TMB is added to each of the wells of the microtitration plate and allowed to react during 30 
minutes at room temperature. Thereafter the reaction is stopped by the addition of 100 ul of a 4N hfeSO* 
solution and the absorbance is read at 450 nm. 

C. Western blot analysis 

For this analysis the HTLV-III antigens of S 2 are first separated by SDS-PAGE using a method 
according to Laemmli (Nature 227, 680-5; 1970), and the thus separated antigens are subsequently 
transferred to nitrocellulose sheetsTaccording to the method described by Towbin et al. (Proc.Natl.Acad.ScL 
76, 4350-4). 

D. Reactivity of Hybrid Proteins 

Hybrid proteins expressed by plasmids listed in the first column of Table I were coated to microtitration 
plates and showed the reactivity indicated in Table I with Serum 1 and Serum 2 containing antibodies to 
gp41 and both gp41 and gp120 proteins, respectively, in the test system described under B. These results 
confirm the reactivity of peptide sequences indicated by the bars A, B, B 1 and C in Figure 7. The higher 
molecular weight hybrid proteins illustrate protein synthesis beginning with the lac operon (e.g., 206 M.W. 
hybrid) and the lower molecular weight hybrids are proteins for which synthesis is initiated with the naturally 
occurring methionine codon 483. 



TABLE I 



30 



Plaamid 



M.W. 
hybrid 



Serum 1 
<*p41+) 



Serum 2 
(gp41+ and 
gpl20+> 



35 



40 



45 



50 



pENVPST 1 


206-149K 


♦ 


+ 


pENVPSTH3 1-2 


200-149K 




+ 


pENVPSTH3 2-10 


20O-14OK 




+ 


p45ES1.3 


185K 






pENVKPN 24 


200-150K 


+ 


+ 


pENVKPN 28 


200-149K 


+ 


+ 


pENVKPNH3 28-23 


190-145K 


♦ 


+ 


pENVKPNH3 24-10 


180-1 33K 


+ 


+ 


pENVKPNH3 28-24 


155-1 18K 






p£NVKPNH3 28-45 


152-117K 




+ 


pENVKPN 45 


150K 







55 E. The use of a combination of recombinant proteins in a screening assay for antibodies against HTLV-HI 

a. A number of sera from individuals at risk of contracting an infection with HTLV-III was tested in an 
ELISA, using the procedure described under B. Plates were coated with HTLV-III recombinant proteins 
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(S2 fractions prepared as described under A.). Plates a. were coated with proteins (2.5 ug/ml) from clone 
24-10SB.3, plates b. were coated with proteins (1.5 ug/ml) from clone gag-8 and plates c. were coated 
with a mixture of the two proteins (2.5 ug/ml 24-10SB.3 plus 1.5 ug/ml gag-8). 

The results were compared with data obtained by the Western blot (WB) technique described under C. 
5 and a commercial ELISA (Vironostika), both using natural viral lysate as the source of antigen (Table II). 



TABLE II 

10 





Plates: 




a 


b 




c 






VIRONOS 




antigen! 


24- 


10SB3 




-8 


combination 


WB 


TIKA 


15 


sample 




















1 


.24 




.27 




.61 


+ 


41 


1.04 + 


20 


2 


.19 




1.26 


+ 


1.32 


+ 


24,41 


0.30 + 




3 


.46 


+ 


.31 




.88 




41 


2.65 ♦ 




4 


1.79 


+ 


.41 


+ 


2.51 


+ 


41,24 


>3.00 + 




5 


1.41 




.74 


♦ 


2.50 


+ • 


41,24 


>3.00 + 


25 


6 


.33 




.53 




.80 




24 


.12 - 




7 


.22 




.36 




.55 




neg. 


.20 - 




Low positive 


















30 


control 


. 39 


+ 


.62 




1.52 


+ 


41,24 


1.31 + 




NHS 


.14 




.29 




.40 




neg. 


.12 - 




Cut off 




















value 


.24 




.39 




.60 






.68 



35 



The above data show that a test system wherein a combination of two antigenic polypeptides according 
to the invention are employed performs more reliably than a test system with a single type of antigenic 
40 polypeptide. 

b. Plates d. were coated with a combination of S2 fractions of three recombinant proteins (clones gag-8 
(1.5 ug/ml), 24-10SB.3 (2.5 ug/ml) and p45 ESL3 (2.5 ug/ml) by methods described above. A number of 
sera, previously analyzed for anti-HTLV-lll antibodies by Western blot and ELISA (Vironostika) tech- 
niques, using natural viral lysate as the source of antigen, was treated on these plates, as well as on the 
45 plates with gag-8 and 24-10SB.3 antigen (plate c. described. 

Table III 

50 

Plates: 
coated with: 
Samples (n) 

55 

Western blot positive 100 



d c VIRONOSTIKA 

3 antigens 2 antigens viral lysate 

99 pos. 89 pos. 95 pos. 
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It thus appears that a test system wherein the combination of three antigenic polypeptides employed 
scores significantly more of the samples that were found to be positive by Western blot analysis. 

F. The effect of the length of the polypeptide chain of the p41 recombinant protein 

5 ' ~~ ~~ — 

Plates were coated with S2 preparations of the following clones: plates e. with 2-10 SB.5 (2.5 ug/ml), 
plates f. with 28 EB1.1 (2.5 ug/ml) and plates g. with 28.23 EB4.1 (2.5 ug/ml)7Sera were tested using these 
plates by the methods described above. The results (Table IV) are compared with those obtained on plates 
a. (coated with clone 24-10 SB3) and with Western blot. 

10 

Table IV 



75 



25 



30 



Plate: 


a 








f 








We a tern 


antigen: 


24-10 


SB. 3 


2-10 


SB.5 


28 EB1.1 


28-23 EB4 


.1 


blot on 




















viral 


Samples 


















lyeate 


11 


0.23 




2.30 


+ 


2.83 


+ 


>3.00 


* 


41,24 


12 


0.7 




.27 




.34 




.48 




41 weak 


13 


.19 




1.05 


+ 


.78 


+ 


.94 


+ 


41,24 


14 


.15 




.85 




1.47 


+ 


1.95 




41.24 


15 


.24 


+ 


1.20 


+ 


1.44 




1.93 


+ 


41 


16 


.46 




>3 


+ 


>3 


+ 


>3 




41 


17 


.24 


+ 


1.42 




1.68 


+ 


2.00 




41 


NHS 


.14 




.82 




.68 




.63 






Cut off 




















value 


.24 




.92 




.78 




.73 







These data indicate that some of the sera which are unreactive with plate a. show positive reaction with 
plates e.,f. and g. From this result it is concluded that the extra polypeptide Iragment in clone 2-10 SB.5 
(bar D in Figure 7) is responsible for the binding of an extra class of antibodies, and hence that this 
40 polypeptide fragment comprises another antigenic determinant. 

Another difference is detected with sample 14, which was negative both with plate a. and e., and 
significantly positive with both plates f. and g. This result suggests the presence of another antigenic 
determinant comprised within the extra polypeptide fragment in clone 28 EB1.1 as compared to clone 2-10 
(bar E in Figure 7). 

45 Furthermore it is tempting to assume that the transmembrane segment of gp4l (the shaded area in the 
uppermost bar of Figure 7 between amino acids No. 684 and 715) will not be involved in the binding of 
antibodies in vivo. From this assumption it is inferred that the antigenic determinants of D and E are 
comprised in the segments indicated by bars D 1 and E\ respectively. 

50 Claims 

1. Immunochemical reagent characterized in that it comprises a combination of two or more synthetic 
polypeptide sequences selected from: 

- a polypeptide sequence containing at least one antigenic determinant of the gag-antigen of HTLV- 
55 III isolate XHAT-3; 

- a polypeptide sequence containing at least one antigenic determinant of the glycoprotein gp120 
of HTLV-III isolate XHAT-3; and 
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- a polypeptide sequence containing at least one antigenic determinant of the glycoprotein gp41 of 
HTLV-III isolate XHAT-3, said isolate comprising genomic DNA from HTLV-III isolate HAT-3 virus 
deposited with the ATCC under No. 40213. 

5 2. Immunochemical reagent according to claim 1, characterized in that it comprises a polypeptide 
corresponding to the carboxyterminal about 58 amino acids long polypeptide of the gp120 protein. 

3. Immunochemical reagent according to claim 2, characterized in that it comprises a polypeptide having 
comparable immunochemical properties as a polypeptide of the sequences represented in figure 9A. 

10 

4. Immunochemical reagent according to claim 1, characterized in that it comprises a polypeptide 
corresponding to the carboxyterminal about 38 amino acids long polypeptide of the gp120 protein. 

5. Immunochemical reagent according to claim 4, characterized in that it comprises a polypeptide having 
75 comparable immunochemical properties as any of the polypeptides of the sequences represented in 

figure 9B. 

6. Immunochemical reagent according to any one of claims 1 to 5, characterized in that it comprises a 
polypeptide having comparable immunochemical properties as the partial amino acid sequence 

20 extending between amino acid no. 53 and amino acid no. 122 of the gp41 protein. 

7. Immunochemical reagent according to claim 6, characterized in that it comprises a polypeptide having 
comparable immunochemical properties as a polypeptide of the sequences represented in figure 10, 
corresponding to amino acids 571-640 of the HTLV-III sequence. 

25 

8. Immunochemical reagent according to any one of claims 1 to 7, characterized in that it comprises a 
polypeptide having comparable immunochemical properties as the partial amino acid sequence 
extending between amino acid no. 123 and amino acid no. 172 of the gp41 protein. 

30 9. Immunochemical reagent according to claim 8, characterized in that it comprises a polypeptide having 
comparable immunochemical properties as a polypeptide of the sequences represented in figure 11 A, 
corresponding to amino acids 641-690 of the HTLV-III sequence. 

10. Immunochemical reagent according to any one of claims 1 to 7, characterized in that it comprises a 
35 polypeptide having comparable immunochemical properties as the partial amino acid sequence 

extending between amino acid no. 123 and amino acid no. 163 of the gp41 protein. 

11. Immunochemical reagent according to claim 10, characterized in that it comprises a polypeptide having 
comparable immunochemical properties as a polypeptide of the sequences represented in figure 1 1 B, 

40 corresponding to amino acids 641-681 of the HTLV-III sequence. 

12. Immunochemical reagent according to any one of claims 1 to 11, characterized in that it comprises a 
polypeptide having comparable immunochemical properties as the partial amino acid sequences 
extending between amino acid no. 173 and amino acid no. 217 of the gp41 protein. 

45 

13. Immunochemical reagent according to claim 12, characterized in that it comprises a polypeptide having 
comparable immunochemical properties as a polypeptide of the sequences represented in figure 12A, 
corresponding to amino acids 691-737 of the HTLV-III sequence. 

so 14. Immunochemical reagent according to any one of claims 1 to 11, characterized in that it comprises a 
polypeptide having comparable immunochemical properties as the partial amino acid sequence 
extending between amino acid no. 196 and amino acid no. 217 of the gp41 protein. 

15. Immunochemical reagent according to claim 14, characterized in that it comprises a polypeptide having 
55 comparable immunochemical properties as a polypeptide of the sequences represented in figure 12B, 
corresponding to amino acids 714-737 of the HTLV-III sequence. 
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16. Immunochemical reagent according to any one of claims 1 to 15, characterized in that it comprises a 
polypeptide having comparable immunochemical properties as the partial amino acid sequence 
extending between amino acid no. 30 and amino acid no. 181 of the gp120 protein. 

s 17. Immunochemical reagent according to claim 16, characterized in that it comprises a polypeptide having 
comparable immunochemical properties as a polypeptide of the sequences represented in figure 8. 

18. Synthetic polypeptide comprising at least one amino acid sequence having the immunochemical 
properties of the amino acid sequence of HTLV-III isolate XHAT-3, said isolate comprising genomic 

70 DNA from HTLV-III isolate HAT-3 virus deposited with the ATCC under No. 40213 which polypeptide is 
selected from: 

- the carboxyterminal 38 amino acids long polypeptide of the gp120 protein; 

- the partial amino acid sequence extending between amino acid no. 53 and amino acid no. 122 of 
the gp41 protein; 

75 - the partial amino acid sequence extending between amino acid no. 123 and amino acid no. 163 of 

the gp41 protein; 

- the partial amino acid sequence extending between amino acid no. 196 and amino acid no. 217 of 
the gp41 protein; and 

- the partial amino acid sequence extending between amino acid no. 30 and amino acid 1 81 of the 
20 gp1 20 protein. 

19. Synthetic polypeptide according to claim 18 comprising at least one of the amino acid sequences 
represented in the figures 8, 9A, 9B, 10, 11 A, 11B, 12A and 12B, corresponding to a part of the XHAT-3 
sequence. 

25 

20. A recombinant DNA characterized in that it comprises a DNA sequence coding for the production of a 
polypeptide according to claim 18 or 19. 

21. A plasmid characterized in that it comprises a recombinant DNA according to claim 20. 

22. A host cell characterized in that it is transformed by the plasmid according to claim 21 . 

23. The HTLV-III isolate XHAT-3, comprising genomic DNA from HTLV-III isolate HAT-3 virus deposited 
with the ATCC under No. 40213. 

24. A plasmid according to claim 21, characterized in that it is derived from the HTLV-III isolate of claim 23. 

25. The plasmid of claim 24, selected from the group consisting of p24-10SB.3, pENVKPN45, pGAG8, p2- 
1 0SB.5, p28-23EB.4, and p28-EB.1 . 

26. The plasmid of claim 24, selected from the group consisting of pENVPSTI , pENVPSTH3, pENVPSTH3 
1-2, pENVPSTN3 2-10, p2-10 EB.3, pENVKPN, pENVKPN24, pENVKPNH3 24, pENVKPNH3 24-10, 
pENVKPN 28, pENVKPNH3 28, pENVKPNH3 28-23, pENVKPNH3 28-24, and pENVKPNH3 28-45. 

45 27. A method for preparing the plasmid of claim 21, comprising: 

(a) inserting a cDNA sequence from HTLV-III into a plasmid for transforming a cellular host; 

(b) deleting from the plasmid of step (a) a portion of an identifiable, expressable sequence to effect 
an out-of-phase frameshift for that sequence; 

(c) transforming a cellular host with the plasmid of step (b); 
so (d) culturing the transformed host; 

(e) selecting the host cells showing spontaneous mutations that effect an in-phase frameshift for the 
identifiable sequence of step (b); 

(f) culturing the host cell selected in step (e); 

(g) selecting the host cell that produce useful amounts of polypeptides that are effectively im- 
55 munoreactive with HTLV-III antibodies; and 

(h) isolating and identifying plasmids that code for the polypeptides of step (g). 



30 
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28. A polypeptide characterized in that it is expressed by host cell transformed with a plasmid according to 
claim 25 or 26. 

29. A test kit for use in an immunoassay characterized in that it comprises an immunochemical reagent 
5 according to any one of claims 1 to 17. 

Patentanspruche 

1. Immunochemisches Reagens, dadurch gekennzeichnet, dass es eine Kombination von zwei Oder mehr 
10 synthetischen Polypeptidsequenzen enthalt, ausgewahlt aus : 

- einer Polypeptidsequenz, welche mindestens einen antigenen Determinanten des gag-Antigens 
von HTLV-lll-lsolat XHAT-3 enthalt; 

- eine Polypeptidsequenz, welche mindestens einen antigenen Determinanten des Glycoproteins 
gp120 von HTLV-lll-lsolat XHAT-3 enthalt, und 

75 - eine Polypeptidsequenz, welche mindestens einen antigenen Determinanten des Glycoproteins 

gp41 von HTLV-lll-lsolat XHAT-3 enthalt, welches Isolat genomische DNA aus HTLV-lll-lsolat 
HAT-3- Virus, hinterlegt bei der ATCC unter der No. 40213, umfasst. 

2. Immunochemisches Reagens nach Anspruch 1, dadurch gekennzeichnet, dass es ein Polypeptid 
20 entsprechend dem carboxyendstandigen, etwa 58 Aminosauren langen Polypeptid des gp1 20-Proteins 

enthalt. 

3. Immunochemisches Reagens nach Anspruch 2, dadurch gekennzeichnet, dass es ein Polypeptid 
enthalt, welches vergleichbare immunochemische Eigenschaften wie ein Polypeptid der in Figur 9A 

25 dargestellten Sequenzen aufweist. 

4. Immunochemisches Reagens nach Anspruch 1, dadurch gekennzeichnet, dass es ein Polypeptid 
entsprechend den carboxyendstandigen, etwa 38 Aminosauren langen Polypeptid des gp1 20-Proteins 
enthalt. 

30 

5. Immunochemisches Reagens nach Anspruch 4, dadurch gekennzeichnet, dass es ein Polypeptid 
enthalt, welches vergleichbare immunochemische Eigenschaften wie die Polypeptide der in Figur 9B 
dargestellten Sequenzen aufweist. 

35 6. Immunochemisches Reagens nach einem der Anspruche 1 bis 5, dadurch gekennzeichnet, dass es ein 
Polypeptid enthalt, welches vergleichbare immunochemische Eigenschaften wie die partielle Aminosau- 
resequenz aufweist, welche sich zwischen Aminosaure No. 53 und Aminosaure No. 122 des gp41- 
Proteins erstreckt. 

40 7. Immunochemisches Reagens nach Anspruch 6, dadurch gekennzeichnet, dass es ein Polypeptid 
enthalt, welches vergleichbare immunochemische Eigenschaften wie ein Polypeptid der in Figur 10 
dargestellten Sequenzen, entsprechend den Aminosauren 571-640 der HTLV-lll-Sequenz, aufweist. 

8. Immunochemisches Reagens nach einem der Anspruche 1 bis 7, dadurch gekennzeichnet, dass es ein 
45 Polypeptid enthalt, welches vergleichbare immunochemische Eigenschaften wie die partielle Aminosau- 

resequenz aufweist, welches sich zwischen Aminosaure No. 123 und Aminosaure No. 172 des gp4i- 
Proteins erstreckt. 

9. Immunochemisches Reagens nach Anspruch 8, dadurch gekennzeichnet, dass es ein Polypeptid 
50 enthalt, welches vergleichbare immunochemische Eigenschaften wie ein Polypeptid der in Figur 11 A 

dargestellten Sequenzen, entsprechend den Aminosauren 641-690 der HTLV-lll-Sequenz, aufweist. 

10. Immunochemisches Reagens nach einem der AnsprGche 1 bis 7, dadurch gekennzeichnet, dass es ein 
Polypeptid enthalt, welches vergleichbare immunochemische Eigenschaften wie die partielle Aminosau- 

55 resequenz aufweist, welche sich zwischen Aminosaure No. 123 und Aminosaure No. 162 des gp41- 
Proteins erstreckt. 
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11. Immunochemisches Reagens nach Anspruch 10, dadurch gekennzeichnet, dass es ein Polypeptid 
enthalt, welches vergleichbare immunochemische Eigenschaften wie ein Polypeptid der in Figur 1 1 B 
dargestellten Sequenzen, entsprechend den Aminosauren 641-681 der HTLV-lll-Sequenz, aufweist. 

5 12. Immunochemisches Reagens nach einem der Anspruche 1 bis 11, dadurch gekennzeichnet, dass es 
ein Polypeptid enthalt, welches vergleichbare immunochemische Eigenschaften wie die partiellen 
Aminosauresequenzen aufweist, welche sich zwischen Aminosaure No. 173 und Aminosaure No. 217 
des gp41 -Proteins erstrecken. 

70 13. Immunochemisches Reagens nach Anspruch 12, dadurch gekennzeichnet, dass es ein Polypeptid 
enthalt, welches vergleichbare immunochemische Eigenschaften wie ein Polypeptid der in Figur 12A 
dargestellten Sequenzen, entsprechend den Aminosauren 691-737 der HTLV-lll-Sequenz, aufweist. 

14. Immunochemisches Reagens nach einem der AnsprOche 1 bis 11, dadurch gekennzeichnet, dass es 
75 ein Polypeptid enthalt, welches vergleichbare immunochemische Eigenschaften wie die partielle Amino- 
sauresequenz aufweist, welche sich zwischen Aminosaure No. 196 und Aminosaure No. 217 des gp41- 
Proteins erstreckt. 

15. Immunochemisches Reagens nach Anspruch 14, dadurch gekennzeichnet, dass es ein Polypeptid 
20 enthalt, welches vergleichbare immunochemische Eigenschaften wie ein Polypeptid der in Figur 12B 

dargestellten Sequenzen, entsprechend den Aminosauren 714-737 der HTLV-lll-Sequenz, aufweist. 

16. Immunochemisches. Reagens nach einem der Anspruche 1 bis 15, dadurch gekennzeichnet, dass es 
ein Polypeptid enthalt, welches vergleichbare immunochemische Eigenschaften wie die partielle Amino- 

25 sauresequenz aufweist, welche sich zwischen Aminosaure No. 30 und Aminosaure No. 181 des gp120- 
Proteins erstreckt. 

17. Immunochemisches Reagens nach Anspruch 16, dadurch gekennzeichnet, dass es ein Polypeptid 
enthalt, welches vergleichbare immunochemische Eigenschaften wie ein Polypeptid der in Figur 8 

30 dargestellten Sequenzen aufweist. 

18. Synthetisches Polypeptid, welches mindestens eine Aminosauresequenz enthalt, welche die immuno- 
chemischen Eigenschaften der Aminosauresequenz von HTLV-lll-lsolat- XHAT-3 aufweist; welches 
Isolat die genomische DNA auf HTLV-lll-lsolat-HAT-3-Virus, hinterlegt bei der ATCC unter der No. 

35 40213 umfasst, wobei das Polypeptid ausgewMhlt ist aus: 

- dem carboxyendstandigen, 38 Aminosauren langen Polypeptid des gp120-Proteins; 

- der partiellen Aminosauresequenz, welche sich zwischen Aminosaure No. 53 und Aminosaure 
No. 122 des gp41 -Proteins erstreckt; 

- der partiellen Aminosauresequenz, welche sich zwischen Aminosaure No. 123 und Aminosaure 
40 No. 163 des gp41 -Proteins erstreckt; * 

- der partiellen Aminosauresequenz, welche sich zwischen Aminosaure No. 196 und Aminosaure 
No. 217 gp41 -Proteins erstreckt; und 

- der partiellen Aminosauresequenz, welche sich zwischen Aminosaure No. 30 und Aminosaure 
No. 181 des gp120-Proteins erstreckt. 

45 

19. Synthetisches Polypeptid nach Anspruch 18, welches mindestens eine der in den Figuren 8, 9A, 9B, 
10, 11 A, 11B, 12A und 12B dargestellten Aminosauresequenzen, entsprechend einem Teil der XHAT-3- 
Sequenz, enthalt. 

so 20. Eine Rekombinante DNA, dadurch gekennzeichnet, dass sie eine DNA-Sequenz enthalt, welche fGr die 
Erzeugung eines Polypeptides gemass Anspruch 18 Oder 19 kodiert. 

21. Ein Plasmid, dadurch gekennzeichnet, dass es eine Rekombinante DNA nach Anspruch 20 enthalt. 

55 22. Wirtzelle, dadurch gekennzeichnet, dass sie durch das Plasmid nach Anspruch 21 transformiert wird. 

23. Das HTLV-lll-lsolat-XHAT-3, enthaltend genomische DNA aus HTLV-lll-lsolat-HAT-3-Virus, hinterlegt bei 
der ATCC unter No. 40213. 
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24. Ein Plasmid nach Anspruch 21, dadurch gekennzeichnet, dass es von HTLV-lll-lsolat nach Anspruch 23 
abgeleitet ist. 



Das Plasmid nach Anspruch 24, ausgewahlt aus der Gruppe bestehend aus p24-10SB.3, pENVKPN45, 
pGAG8, p2-10SB.5, p28-23EB.4 und p28-EB.1 . 

Das Plasmid nach Anspruch 24, ausgewahlt aus der Gruppe bestehend aus pENVPSTI , pENVPSTH3, 
pENVPSTH3 1-2, pENVPSTH3 2-10, p2-10 EB.3, pENVKPN, pENVKPN24, pENVKPNH3 24, 
pENVKPNH3 24-10, pENVKPN 28, pENVKPNH3 28-23, pENVKPNH3 28-24 und pENVKPNH3 28-45. 

Ein Verfahren zur Herstellung des Plasmides nach Anspruch 21 , umfassend : 

(a) die Einfuhrung einer cDNA-Sequenz aus HTLV-III in ein Plasmid zur Transformierung eines aus 
Zellen bestehenden Wirtes; 

(b) Entfernung eines Teils einer identifizierbaren, exprimierbaren Sequenz aus dem Plasmid von 
Stufe (a), urn eine ausser-Phasen-Rahmenverschiebung fur die Sequenz zu bewirken: 

(c) Transformieren eines aus Zellen bestehenden Wirtes mit dem Plasmid aus Stufe (b); 

(d) Zuchtung des transrormierten Wirtes; 

(e) Auswahl der Wirtzellen, welche spontane Mutationen aufweisen, welche eine in-Phasen-Rahmen- 
verschiebung fur die identifizierbare Sequenz der Stufe (b) bewirken, 

(f) Zuchtung der in Stufe (e) ausgewahlten Wirtzelle; 

(g) Auswahl der Wirtzelle, welche nutzliche Mengen an Polypeptiden erzeugt, welche mit HTLV-III- 
AntikSrpern wirksam immunologisch reaktionsfahig sind; und 

(h) Isolierung und Identifizierung von Plasmiden, welche fur die Polypeptide der Stufe (g) kodieren. 

Ein Polypeptid, dadurch gekennzeichnet, dass es durch Wirtzellen exprimiert wird, welche mit einem 
Plasmid nach Anspruch 25 Oder 26 transformiert sind. 

Eine Testgarnitur zur Verwendung in einem immunologischen Test, dadurch gekennzeichnet, dass sie 
ein immunochemisches Reagens nach einem der Ansprtiche 1 bis 17 enthalt. 

Revendications 

1. Reactif immunochimique caracterise par le fait qu'il comprend une association de deux ou plusieurs 
sequences polypeptidiques synthStiques choisies parmi : 

35 - une sequence polypeptidique contenant au moins un determinant antigenique de I'antigene de 

gag de I'isolat XHAT-3 de HTLV-III ; 

- une sequence polypeptidique contenant au moins un determinant antigenique de la glycoproteine 
gp120 de I'isolat XHAT-3 de HTLV-III ; et 

- une sequence polypeptidique contenant au moins un determinant antigenique de la glycoproteine 
40 gp41 de I'isolat XHAT-3 de HTLV-III ; 

ledit isolat comprenant un ADN genomique provenant du virus HAT-3, isolat de HTLV-III, depose a 
ATCC sous le N° 40213. 

2. Reactif immunochimique selon la revendication 1 , caracterise par le fait qu'il comprend un polypeptide 
45 correspondant au polypeptide carboxyterminal long d'environ 58 acides amines de la proteine gp120. 

3. Reactif immunochimique selon la revendication 2, caracterise par le fait qu'il comprend un polypeptide 
ayant des proprietes immunochimiques comparables a celles d'un polypeptide forme des sequences 
representees sur la Figure 9A. 

50 

4. Reactif immunochimique selon la revendication 1 , caracterise par le fait qu'il comprend un polypeptide 
correspondant au polypeptide carboxyterminal long d'environ 38 acides amines de la proteine gp120. 

5. Reactif immunochimique selon la revendication 4, caracterise* par le fait qu'il comprend un polypeptide 
55 ayant des proprietes immunochimiques comparables a celles de Tun quelconque des polypeptides 

forme des sequences representees sur la Figure 9B. 
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6. Reactif immunochimique selon Tune quelconque des revendications 1 a 5, caracterise par le fait qu'il 
comprend un polypeptide ayant des proprietes immunochimiques comparables a ceiles de la sequence 
d'acides amines partielle s'etendant entre I'acide amine N° 53 et I'acide amine N° 122 de la proline 
gp41. 

7. Reactif immunochimique selon la revendication 6, caracterise par le fait qu'il comprend un polypeptide 
ayant des proprietes immunochimiques comparables a celles d un polypeptide forme des sequences 
representees sur la Figure 10, correspondant aux acides amines N° 571 a 640 de la sequence de 
HTLV-III. 

8. Reactif immunochimique selon Tune quelconque des revendications 1 a 7, caracterise par le fait qu'il 
comprend un polypeptide ayant des proprietes immunochimiques comparables a celles de ia sequence 
d'acides amines partielle s'etendant entre I'acide amine N° 123 et I'acide amine N° 172 de la proteine 
gp41. 

9. Reactif immunochimique selon la revendication 8, caracterise par le fait qu'il comprend un polypeptide 
ayant des proprietes immunochimiques comparables a celles d'un polypeptide forme des sequences 
representees sur la Figure 11 A, correspondant aux acides amines N° 641 a 690 de la sequence de 
HTLV-III. 

10. Reactif immunochimique selon I'une quelconque des revendications 1 a 7, caracterise par le fait qu'il 
comprend un polypeptide ayant des proprietes immunochimiques comparables a celles de la sequence 
d'acides amines partielle s'etendant entre I'acide amine N° 123 et I'acide amine N° 163 de la proteine 
gp41. 

11. Reactif immunochimique selon la revendication 10, caracterise par le fait qu'il comprend un polypeptide 
ayant des proprietes immunochimiques comparables a celles d'un polypeptide forme des sequences 
representees sur la Figure 11B, correspondant aux acides amines N° 641 a 681 de la sequence de 
HTLV-III. 

12. Reactif immunochimique selon I'une quelconque des revendications 1 k 11, caracterise par le fait qu'il 
comprend un polypeptide ayant des proprietes immunochimiques comparables a celles de la sequence 
d'acides amines partielle s'etendant entre I'acide amine N° 173 et I'acide amine N° 217 de la proteine 
gp41. 

13. Reactif immunochimique selon la revendication 12, caracterise. par le fait qu'il comprend un polypeptide 
ayant des proprietes immunochimiques comparables a celles d'un polypeptide forme des sequences 
representees sur la Figure 12A, correspondant aux acides amines N° 691 a 737 de la sequence de 
HTLV-III. 

14. Reactif immunochimique selon I'une quelconque des revendications 1 a 11, caracterise par le fait qu'il 
comprend un polypeptide ayant des proprietes immunochimiques comparables a celles de la sequence 
d'acides amines partielle s'etendant entre I'acide amine N° 196 et I'acide amine N° 217 de la proteine 
gp41. 

15. Reactif immunochimique selon la revendication 14, caracterise par le fait qu'il comprend un polypeptide 
ayant des proprietes immunochimiques comparables a celles d'un polypeptide forme des sequences 
representees sur la Figure 12B, correspondant aux acides amines N° 714 a 737 de la sequence de 
HTLV-III. 

16. Reactif immunochimique selon I'une quelconque des revendications 1 a 15, caracterise par le fait qu'il 
comprend un polypeptide ayant des proprietes immunochimiques comparables a celles de la sequence 
d'acides amines partielle s'etendant entre I'acide amine N° 30 et I'acide amine N° 181 de la proteine 
gp120. 

17. Reactif immunochimique selon la revendication 16, caracterise par le fait qu'il comprend un polypeptide 
ayant des proprietes immunochimiques comparables a celles d'un polypeptide forme des sequences 
representees sur la Figure 8. 
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18. Polypeptide synthetique comprenant au moins une sequence d'acides amines ayant les propri6t6s 
immunochimiques de la sequence d'acides amines de I'isolat XHAT-3 de HTLV-III, ledit isolat compre- 
nant un ADN ggnomique provenant du virus HAT-3, isolat de HTLV-III, d£pos£ a ATCC sous le N° 
40213, ce polypeptide etant choisi parmi : 
5 - le polypeptide carboxyterminal long de 38 acides amines de la proteine gp120 ; 

- la sequence d'acides amines partielle s'etendant entre I'acide amin£ N° 53 et I'acide amine N° 
122 de la proline gp41 ; 

- la sequence d'acides amines partielle s'etendant entre I'acide amine N° 123 et I'acide amine N° 
163 de la proteine gp41 ; 

10 - la sequence d'acides amines partielle s'etendant entre I'acide amine N° 196 et I'acide amine N° 

217 de la proline gp41 ; et 

- la sequence d'acides amines partielle s'etendant entre I'acide amine N° 30 et I'acide amine N° 
181 de la proteine gp120. 

15 19. Polypeptide synthetique selon la revendication 18, comprenant au moins I'une des sequences d'acides 
amines representees sur les Figures 8, 9A, 9B, 10, 11 A, 11 B, 12A et 12B, correspondant a une partie 
de la sequence de XHAT-3. 

20. ADN recombinant caracterise par le fait qu'il comprend une sequence d'ADN codant pour la production 
20 d'un polypeptide selon la revendication 18 ou 19. 

21. Plasmide caracterise par le fait qu'il comprend un ADN recombinant selon la revendication 20. 

22. Cellule-hdte caracterisee par le fait qu'elle est transformee par le plasmide selon la revendication 21. 

25 

23. Isolat XHAT-3 de HTLV-III, comprenant un ADN g^nomique provenant du virus HAT-3, isolat de HTLV- 
III, depose a ATCC sous le N° 40213. 

24. Plasmide selon la revendication 21, caracterise par le fait qu'il est derive de I'isolat de HTLV-III de la 
30 revendication 23. 

25. Plasmide de la revendication 24, choisi dans le groupe forme par p24-10SB.3, pENVKPN45, pGAG8, 
P2-10SB.5, p28-23EB.4 et p28-EB.1 . 

35 26. Plasmide de la revendication 24, choisi dans le groupe forme par pENVPSTI , pENVPSTH3, 
pENVPSTH3 1-2, pENVPSTH3 2-10, p2-10 EB.3, pENVKPN, pENVKPN24, pENVKPNH3 24, 
pENVKPNH3 24-10, pENVKPN 28, pENVKPNH3 28, pENVKPNH3 28-23, pENVKPNK3 28-24 et 
pENVKPNH3 28-45. 

40 27. Precede de preparation du plasmide de la revendication 21 , consistant a : 

(a) insurer une sequence d'ADNc provenant de HTLV-III dans un plasmide destine a transformer un 
note cellulaire ; 

(b) effectuer dans le plasmide de I'etape (a) la deletion d'une portion d'une sequence identifiable 
capable d'expression, de fagon a produire un decalage de cadre avec .dephasage de cette sequence 

45 ;" 

(c) transformer un note cellulaire avec le plasmide de I'etape (b) ; 

(d) cultiver I'hote transform^ ; 

(e) selectionner les cellules-notes presentant des mutations spontanees qui realisent un decalage de 
cadre avec mise en phase pour la sequence identifiable de I'etape (b) ; 

so (f) cultiver la cellule-hote s£lectionn£e dans I'etape (e) ; 

(g) selectionner la cellule-hote qui produit des quantites utiles de polypeptides ayant une immuno- 
reactivite efficace avec des anticorps anti-HTLV-lll ; et 

(h) isoler et identifier les piasmides qui codent pour les polypeptides de I'etape (g). 

55 2a Polypeptide caracterise par le fait qu'il est exprime par une cellule-hote transformee par un plasmide 
selon la revendication 25 ou 26. 
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29. Necessaire d'essai a utiliser dans une analyse immunologique, caractdrise par le fait qu'il comprend un 
reactif immunochimique selon Tune quelconque des revendications 1 & 17. 
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Figure 2.1 

GAG-1 

SEQUENCE OF HAT3 GAG GENE 

10 20 30 40 50 

* * * * * 

CCT ATA GTG CAG AAC CTT CAG GGG CAA ATG GTA CAT CAA GCC ATA TCA CCT AGA ACT 
Pro He Val Gin Asn Leu Gin Gly Gin Met Val His Gin Ala He Ser Pro Arg Thr 

1 10 

60 70 80 90 100 110 

* * * * *' * 

TTA AAT GCA TGG GTA AAA GTA GTA GAA GAG AAG GCT TTT AGC CCA GAA GTA ATA CCC 
Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro Glu Val lie Pro 

* * 

20 30 

120 130 140 150 160 170 

* * * * * * 

ATG TTT TCA GCA TTA TCA GAA GGA GCC ACC CCA CAA GAT TTA AAC ACC ATG CTA AAC 
Met Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn 

* • * 
40 50 

180 190 200 210 220 

* * * * * 

ACA GTG GGG GGA CAT CAA GCA GCC ATG CAA ATG TTA AAA GAG ACT ATC AAT GAG GAA 
Thr Val Gly Gly His Gin Ala Ala Met Gin Met Leu Lys Glu Thr He Asn Glu Glu 

* * 

60 70 

230 240 250 260 270 280 

* * * * * * 

G CT GCA GA A TGG GAT AGA GTA CAT CCA GTG CAT GCA GGG CCT ATT GCA CCA CGT CAG 
Ala Ala Glu Trp Asp Arg VaT His Pro Val His Ala Gly Pro He Ala Pro Arg Gin 

80 go 

290 300 310 320 330 340 

* * * * * * * 
ATG AGA GAA CCA AGG GGA AGT GAC ATA GCA GGA ACT ACT AGT ACC CTT CAG GAA CAA 
Met Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin 

* * 

100 no 

350 360 370 380 390 

* * * * * 

ATA GGA TGG ATG ACA AAT AAT CCA CCT ATC CCA GTA GGA GAA ATC TAT AAA AGG TGG 
He Gly Trp Met Thr Asn Asn Pro Pro He Pro Val Gly Glu He Tyr Lys Arg Trp 

120 130 
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Figure 2.2 



GAG -2 



400 
* 



410 



420 
* 



430 



440 



450 



ATA ATT CTG GGA TTA AAT AAA ATA GTA AGA ATG TAT AGC CCC ATC AGC ATT CTG GAC 
lie lie Leu Gly Leu Asn Lys lie Val Arg Met Tyr Ser Pro lie Ser He Leu Asp 



140 



150 



460 



470 



480 



490 
* 



500 



510 



ATA AGA CAA GGA CCT AAG GAA CCC TTT AGA GAC TAT GTA GAC CGG TTC TAT AAA ACT 
lie Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr 



160 



170 



520 



530 



540 



550 
* 



560 



570 



CTA AGA GCC GAG CAA GCT TCA CAG GAT GTA AAA AAT TGG ATG ACA GAA ACC TTG CTG 
Leu Arg Ala Glu Gin Ala Ser Gin Asp Val Lys Asn Trp Met Thr Glu Thr Leu Leu 



180 



190 



580 



590 
* 



600 



610 
* 



620 
* 



GTC CAA AAT GCG AAC CCA GAT TGT AAA ACT ATT TTA AAA GCA TTG GGA CCA GCA GCT 

Val Gin Asn Ala Asn Pro Asp Cys Lys Thr lie Leu Lys Ala Leu Gly Pro Ala Ala 

* 

200 



630 
* 



640 



650 
* 



660 
* 



670 



680 
* 



ACA CTA GAA GAA ATG ATG ACA GCA TGT CAG GGA GTA GGG GGA CCC AGC CAT AAA GCA 
Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro Ser His Lys Ala 



210 



220 



690 

" * 



700 
* 



710 

* 



720 



730 



740 



AGA ATT TTG GCT GAA GCA ATG AGC CAA GTA ACA AAT TCA GCT ACC ATA ATG CTG CAG 
Arg lie Leu Ala Glu Ala Met Ser Gin Val Thr Asn Ser Ala Thr He Met Leu Gin 



230 



240 



750 
* 



760 

* 



770 



AAA GGT AAT TTT AGG GAC CAA AGA AAA ATT GTT AAG 
Lys Gly Asn Phe Arg Asp Gin Arg Lys He Val Lys 



260 



269 
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Figure 3.1 



ENV-1 



SEQUENCE OF HAT3 ENV GENE 

10 20 30 40 50 

* * * * * 

ATG AGA GTG ATG GAG ATG AGG AAG AAT TGT CAG CAC TTG TGG AAA TGG GGC ACC ATG 
Met Arg Val Met Glu Met Arg Lys Asn Cys Gin His Leu Trp Lys Trp Gly Thr Met 

* * 

1 10 

60 70 80 90 100 110 

* * * * * * 

CTC CTT GGG ATG TTG ATG ATC TGT AGT G CT GCA G AG GAC TTG TGG GTC ACA GTC TAT 
Leu Leu Gly Met Leu Met He Cys Ser Ala Ala Glu Asp Leu Trp Val Thr Val Tyr 

* * 

20 30 

120 130 140 150 160 170 

* * * * * * 

TAT GG G GTA CC T GTG TGG AAA GAA GCA ACC ACC ACT CTA TTT TGT GCA TCA GAA GCT 
Tyr Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Glu Ala 

* * 
40 50 

180 190 200 210 220 

* * * * * 

AAA GCA TAT AAA ACA GAG GTA CAT AAT GTC TGG GCC AAA CAT GCT TGT GTA CCT ACA 
Lys Ala Tyr Lys Thr Glu Val His Asn Val Trp Ala Lys His Ala Cys Val Pro Thr 

* * 
60 70 

230 240 250 260 270 280 

* * * * * * 

GAC CCC AAC CCA CAA GAA GTA CTA TTG GAA AAT GTG ACA GAA AAT TTT AAC ATG TGG 
Asp Pro Asn Pro Gin Glu Val Leu Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp 
* * 

80 90 

290 300 310 320 330 340 

* * * * * * 

AAA AAT AAC ATG GTA GAA CAG ATG CAT GAG GAT ATA ATC AGT TTA TGG GAT CAA AGC 

Lys Asn Asn Met Val Glu Gin Met His Glu Asp He He Ser Leu Trp Asp Gin Ser 
* * 
100 no 

350 360 370 380 390 

* * * * * 

CTA AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT GTT ACT TTA AAT TGC ACT GAT GCT 
Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Thr Asp Ala 
* * 

120 130 
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Figure 3,2 

ENV-2 

400 410 420 430 440 450 

* * ★ * * * 

AAC TTG AAT GGT ACT AAT GTC ACT AGT AGT AGC GGG GGA ACA ATG ATG GAG AAC GGA 

Asn Leu Asn Gly Thr Asn Val Thr Ser Ser Ser Gly Gly Thr Met Met Glu Asn Gly 

* * 

140 150 

460 470 480 490 500 510 

* * * * * * 

GAA ATA AAA AAC TGC TCT TTC CAG GTC ACC ACA AGT AGA AGA GAT AAG ACG CAG AAA 
Glu He Lys Asn Cys Ser Phe Gin Val Thr Thr Ser Arg Arg Asp Lys Thr Gin Lys 

160 170 

520 530 '540 550 560 570 

* * * * * * 

AAA TAT GCA CTT TTT TAT AAA CTT GAT GT G GTA CC A ATA GAG AAG GGT AAT ATT AGC 

Lys Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Pro lie Glu Lys Gly Asn He Ser 

* * 

180 190 

580 590 600 610 620 

* * * * * 

CCT AAG AAT AAT ACT AGC AAT AAT ACT AGC TAT GGT AAC TAT ACA TTG ATA CAT TGT 
Pro Lys Asn Asn Thr Ser Asn Asn Thr Ser Tyr Gly Asn Tyr Thr Leu lie His Cys 

* 

200 

630 640 650 660 670 680 

* * * * * * 

AAT TCC TCA GTC ATT ACA CAG GCC TGT CCA AAG GTA TCC TTT GAG CCA ATT CCC ATA, 
Asn Ser Ser Val He Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He Pro He 

* * 

210 220 

690 700 710 720 730 740 

* * * * * * 

CAT TAT TGC ACC CCG GCT GGT TTT GCG ATT CTA AAG TGT AAT GAT AAG AAG TTC AAT 

His Tyr Cys Thr Pro Ala Gly Phe Ala He Leu Lys Cys Asn Asp Lys Lys Phe Asn * 
* * 
230 240 

750 760 770 780 790 

* * * * * 

GGA ACA GGA CCA TGT AAA AAT GTC AGC ACA GTA CAA TGT ACA CAT GGA ATT AGG CCA 
Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro 
* * 

250 260 

800 810 820 830 840 850 

* * * * * * 

GTA GTG TCA ACT CAA CTG CTG TTA AAT GGC AGT CTA GCA GAA GAA GAG GTA GTA ATT 
Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val He 
* * 

270 280 
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Figure 3.3 

ENV - 3 

860 870 880 890 900 910 

* * * * * * 
AGA TCT GAA AAT TTC ACG GAC AAT GTT AAA ACC ATA ATA GTA CAG CTG AAT GCA TCT 
Arg Ser Glu Asn Phe Thr Asp Asn Val Lys Thr lie He Val Gin Leu Asn Ala Ser 

* * 

920 930 940 950 960 

* * * * * 

GTA CAA ATT AAT TGT ACA AGA CCC AAC AAC AAT ACA AGA AAA AGT ATA ACT AAG GGA 
Val Gin He Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He Thr Lys Gly 
* * 

310 320 

970 980 990 1000 1010 1020 

* * * * * * 

CCA GGG AGA GTA ATT TAT GCA ACA GGA CAA ATA ATA GGA GAT ATA AGA AAA GCA CAT 
Pro Gly Arg Val lie Tyr Ala Thr Gly Gin He He Gly Asp He Arg Lys Ala His 

* * • 

330 340 

1030 1040 1050 1060 1070 1080 

* * * * * * 

TGT AAC CTT AGT AGA GCA CAA TGG AAT AAC ACT TTA AAA CAG GTA GTT ACA AAA TTA 

Cys Asn Leu Ser Arg Ala Gin Trp Asn Asn Thr Leu Lys Gin Val Val Thr Lys Leu 

* * 
350 360 

1090 1100 1110 1120 1130 1140 

* * * * * * 

AGA GAA CAA TTT GAC AAT AAA ACA ATA GTC TTT ACT TCA TCC TCA GGA GGG GAC CCA 
Arg Glu Gin Phe Asp Asn Lys Thr He Val Phe Thr Ser Ser Ser Gly Gly Asp Pro 

* * 
370 380 

1150 1160 1170 1180 1190 

* * * * * 

GAA ATT GTA CTT CAC AGT TTT AAT TGT GGA GGG GAA TTT TTC TAC TGT AAT ACA ACA 
Glu He Val Leu His Ser Phe Asn Cys Glly Gly Glu Phe Phe Tyr Cys Asn Thr Thr 

390 

1200 1210 1220 1230 1240 1250 

* * * * * * 

CAA CTG TTT AAT AGT ACT TGG AAT AGT ACT GAA GGG TCA AAT AAC ACT GGA GGA AAT 
Gin Leu Phe Asn Ser Thr Trp Asn Ser Thr Glu Gly Ser Asn Asn Thr Gly Gly Asn 

400 410 

1260 1270 1280 1290 1300 1310 

* * * * * * 
GAC ACA ATC ACA CTC CCA TGC AGA ATA AAA CAA ATT GTA AAC ATG TGG CAG GAA GTA 
Asp Thr He Thr Leu Pro Cys Arg He Lys Gin He Val Asn Met Trp Gin Glu Val 



420 



430 
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Figure 3,4 

ENV-4 

1320 1330 1340 1350 1360 

* ★ * * * 

GGA AAA GCA ATG TAT GCC CCT CCC ATC AGT GGA CAA ATT AAA TGT ATA TCA AAT ATT 
Gly Lys Ala Met Tyr Ala Pro Pro He Ser Gly Gin He Lys Cys He Ser Asn lie 
* * 

440 450 

1370 1380 1390 1400 1410 1420 

* * * * * * 

ACA GGG CTA CTA TTA ACA AGA GAT GGG GGT GAA GAT ACA ACT AAT ACT ACA GAG ATC 

Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Glu Asp Thr Thr Asn Thr Thr Glu He 
* * 

460 470 



1430 1440 1450 1460 1470 1480 

* * * * * * 

ITC AGA CTT GGA GGA GGA AAT ATG AGG GAC AAT TGG AGA AGT GAA TTA TAT AAA TAT 
Phe Arg Leu Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 
* * ■ 

480 490 

1490 1500 1510 1520 1530 

* * * * * 

AAA GTG GTA AGA ATT GAA CCA TTA GGA GTA GCA CCC ACT AGG GCA AAG AGA AGA GTG 
Lys Val Val Arg He Glu Pro Leu Gly Val Ala Pro Thr Arg Ala Lys Arg Arg Val 
* * 

500 510 



1540 1550 1560 1570 1580 1590 

* * * * * * 

GTG CAA AGA GAA AAA AGA GCA GTG GGA ACA ATA GGA GCT ATG TTC CTT GGG TTC TTG 

Val Gin Arg Glu Lys Arg Ala Val Gly Thr He Gly Ala Met Phe Leu Gly Phe Leu 

* * 

520 530 

1600 1610 1620 1630 1640 1650 

* * * * * * 

GGA GCA GCA GGA AGC ACT ATG GGC GCA GGC TCA ATA ACG CTG ACG GTA CAG GCC AGA 

Gly Ala Ala Gly Ser Thr Met Gly Ala Gly Ser He Thr Leu Thr Val Gin Ala Arg 

* * 
540 550 



1660 1670 1680 1690 1700 1710 

* * * * * * 

CAC TTA TTG TCT GGT ATA GTG CAA CAG CAA AAC AAT TTG CTG AGG GCT ATT GAG GCG 
His Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 

* * 

560 570 



1720 1730 1740 1750 1760 

* * * * * 

CAA CAG CAT CTG TTG CAA CTC ACG GTC TGG GGC ATC AAA CAG CTC CAG GCA AGA GTC 
Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val 

580 
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AA AGA TAC CTA AGG GAT CAA CAG CTC CTA GGA ATT TGG GGA TGC TCT 
lu Arg Tyr Leu Arg Asp Gin Gin Leu Leu Gly He Trp Gly Cys Ser 



600 



1840 
* 



1850 
* 



1860 
* 



1870 
* 



1880 
* 



TT TGC ACC ACT ACT GTG CCT TGG AAT GCT AGT TGG AGT AAT AAA TCT 
le Cys Thr Thr Thr Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser 



620 



1900 
* 



1910 



1920 
* 



1930 

• * ' 



TT TGG AAT AAC ATG ACC TGG ATG CAG TGG GAA AGA GAA ATT GAC AAT 
le Trp Asn Asn Met Thr Trp Met Gin Trp Glu Arg Glu He Asp Asn 



640 



>50 



1960 
* 



1970 
* 



1980 
* 



1990 
* 



A ATA TAC AAC TTA CTT GAA GAA TCG CAG AAC CAG CAA GAA AAG AAT 
e He Tyr Asn Leu Leu Glu Glu Ser Gin Asn Gin Gin Glu Lys Ash 



iO 



660 



2010 
* 



2020 
* 



2030 

* 



2040 
* 



2050 



A TTG GAA TTG GAT AAA TGG GCA AAT TTG TGG AAT TGG TTT GAC ATA 
u Leu Glu Leu Asp Lys Trp Ala Asn Leu Trp Asn Trp Phe Asp He 



670 



680 



2070 



2080 
* 



2090 
* 



2100 



G TGG TAT ATA AGA ATA. TTC ATA ATG ATA GTA GGA GGC TTG GTA GGT 
u Trp Tyr He Arg He Phe He Met He Val Gly Gly Leu Val Gly 



690 



700 



2130 
* 



2140 
* 



2150 



2160 
* 



J" TTT GCT GTG CTT TCT ATA GTG AAT AGA GTT AGG CAG GGA TAC TCA 
I Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser 



710 



720 



2180 

* - 



2190 
* 



2200 

* 



2210 



2220 



CAG ACC CAC CTC CCA GCC CCG AGG GGA CCC GAC AGG CCC GAA GGA 
■ Gln Tnr His Leu Pro Ala Pro Arg Gly Pro Asp Arg Pro Glu Gly 



730 



740 
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LIMV-0 

2240 2250 2260 2270 2280 

* * * * * 

GGT GGA GAG AGA GAC AGA GAC AGA TCC GGC GGT GCA GTG AAT GGA 
Gly Gly Glu Arg Asp Arg Asp Arg Ser Gly Gly Ala Val Asn Gly 
* * 

750 760 

2300 2310 2320 2330 

* * * * 

ATC TGG GAC GAT CTG TGG ACC CTG TGC AGC TTC AGO TAC CAC CGC 
He Trp Asp Asp Leu Trp Thr Leu Cys Ser Phe Ser Tyr His Arg 

770 

^50 2360 2370 2380 2390 

* * * * * 

CTC TTG ATA GTA GTG AGG ATT GTG GAA CTT CTG GGA CGC AGG GGG 
Leu Leu He Val Val Arg He Val Glu Leu Leu Gly Arg Arg Gly 

790 

2410 2420 2430 2440 2450 

* * * * ★ 

AAG TAT TGG TGG AAT CTC CTG CAG TAT TGG AGT CAG GAG CTA AAG 
Lys Tyr Trp Trp Asn Leu Leu Gin Tyr Trp Ser Gin Glu Leu Lys 

* 

810 

2470 2480 2490 2500 

* * * * 

AGC TTG CTT AAT ACC ACA GCA ATA GCA GTA GCT GAA GGG ACA GAT 
Ser Leu Leu Asn Thr Thr Ala He Ala Val Ala Glu Gly Thr Asp 

830 

2530 2540 2550 2560 

* * * * 

CAA AGA ATT CTT AGA GCT TTT CTT CAC ATA CCT AGA AGA 
Gin Arg He Leu Arg Ala Phe Leu His He Pro Arg Arg 

850 

?580 2590 
* * 

TA GAA AGG GCT TTG CTG 
eu Glu Arg Ala Leu Leu 

* * 

'.60 865 



3TA GCA 
/al Ala 



28 



EP 0 227 169 B1 




29 



EP 0 227 169 B1 




30 



EP 0 227 169 B1 



2 




CL 


1 






> 


m 


2 




UJ 




CL 







31 



EP 0 227 169 B1 



Figure 7 
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Figure 8-1 > 

HAT-3 Ala Glu Asp Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val 

HTLV-III Thr Lys — 

ARV-2 Thr Lys 

LAV Thr Lys : : 

Trp Lys Glu Ala Thr Thr Thr Uu Phe Cys Ala Ser Glu Ala Lys Ala 

Aflp Arg 

- ASp Lyg 

Tyr Lys Thr Glu Val His Asn Val Trp Ala Lys His Ala Cys Val Pro 

Asp-- Thr 

Asp Thr 

Asp Thr 

Thr Asp Pro Asn Pro Gin Glu Val Leu Leu Glu Asn Val Thr Glu Asn 
Val Val 

Val Gly 

Val yal 

Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His Glu Asp lie 

Aan GU 

. _ Asp— His 

He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro 



Leu Cys Val Thr Leu Asn Cys Thr Aap Ala Asn Leu Asn Gly Thr Asn 

Ser Lys Leu Lys [ ] Asn Asp 

Thr Asn Leu [ 1 Gly Lys Ala 

Ser Lys Leu Gly [ ] Asn Ala 
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Figure 8-2 

Val Thr Ser Ser Ser Gly Gly Thr Met Met Glu Asn Gly [ ] [ ] [ ] 

Thr Asa Ser Gly Arg Met He Met Giu Lys Gly [ J [ J [ ] 

Thr Asn Asn Trp Lys Glu Glu He [ ] Lys Gly [][][] 

Thr Asn- Asn Thr Asn Ser Ser Ser Gly Glu Bfet Met Met Glu 

[ ] t J Glu He Lys Asn Cys Ser Phe Gin Val Thr Thr Ser Arg Arg 

( ] [ ] :— • Asn He Ser He 

[ ] [ ] Asn He Thr Ile« 

Lys Gly Asn lie Ser— He 

Asp Lys Thr Gin Lys Lys Tyr Ala Leu Phe Tyr Lys Leu Asp Val 

Gly Val Glu Tyr Phe Tyr Lys lie 

Asp lie Glu Asn Leu Arg Asn Vai 

Gly Val Glu Tyr Phe Tyr Lys He 
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Figure 9A 



HAT-3 Arg Asp Gly Gly Gin Asp Thr Thr Asn Thr Thr Glu He Phe 

HTLV-III Asn Ser Asn [ ] Glu Ser He 

ARV-2 Thr Asn Val Thr Asp Thr Val 

LAV — Asn Asn Asn [ ] Glu Ser He 

Arg Leu Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 

Pro Asp: 

Pro Asp 

Pro Asp ■ 



Lys Tyr Lys Val Val Arg He Glu Pro Leu Gly Val Ala Pro Thr Arg 

Val L ys ^val Lys 

H e Ly S xie Lys 

Val Ly8 v a i Lys 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala 



Figure 9B 

HAT-3 Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 

HTLV-III 

ARV-2 

LAV 

Lys Tyr Lys Val Val Arg He Glu Pro Leu Gly Val Ala Pro Thr Arg 

_ H e Lys 7 He Lys 

Val Lys Val Lys 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala 
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Figure 10 

HAT-3 His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 

HTLV-III . 

ARV-2 . 

Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gin Gin Leu Leu 

He —Lys 

— Val Arg 

Gly lie Trp Gly Cys Ser Gly Lys Leu He Cys Thr Thr Thr Val Pro 
„ AXa 

Trp Aan Ala Ser Trp Ser Asn Lys Ser Leu Asn Met He Trp Asn Asn 

Glu Gin : — -Asn 

Asp 

— - — — -Glu Gin — —Asn— — 

Met Thr Trp Met Gin Trp Glu Arg 
Gin -Glu 



36 



EP0 227 169 B1 



Figure 11A 

HAT* 3 Glu He Asp Asn Tyr Thr Gly He He Tyr Asn Leu Leu Glu 

HTLV-III Asn Ser Leu His Ser lie 

ARV-2 — Asp Asn Thr Tyr Thr Leu 

LAV Asn Ser Leu His Ser- He 

Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu 



Asp Lys Trp Ala Asn Leu Trp Asn Trp Phe Asp He Thr Gin Trp Leu 
Trp Tyr He Arg 

— Lys 

-Lys 



Figure 11B 

HAT- 3 Glu He Asp Asn Tyr Thr Gly He He Tyr Asn Leu Leu Glu 

HTLV-III Asn Ser Leu His Ser He 

ARV-2 Asp Asn Thr Tyr Thr Leu 

LAV Asn —Ser Leu His Ser He 

Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu 



Asp Lys Trp Ala Asn Leu Trp Asn Trp Phe Asp 
Ser Afln 

Ser Asn 
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Pigure 12A 

HAT-3 He Phe lie Met He Val Gly Gly Leu Val Gly Leu Lys He Val 

ARV-2 He ™ — Arg 

LAV lie--; Arg 

Phe Ala Val Leu Ser lie Val Asn Arg Val Arg Gin Gly Tyr Ser Pro 



Leu Ser Phe Gin Thr His Leu Pro Ala Pro Arg Gly Pro Asp Arg Pro 

. -His He 

Arg™ Val 



-His- 



Thr— 



Figure 12B 

HAT-3 Arg Val Arg Gin Gly Tyr Ser Pro 
ARV-2 

LAV 

Leu Ser Phe Gin Thr His Leu Pro Ala Pro Arg Gly Pro Asp Arg Pro 

His He 

Arg val 

His Thr 
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